Adaptive Combination of Behaviors in an gent

نویسندگان

  • Olivier Buffet
  • Alain Dutech
  • François Charpillet
چکیده

Agents are of interest mainly when confronted with complex tasks. We propose a methodology for the automated design of such agents (in the framework of Markov Decision Processes) in the case where the global task can be decomposed into simpler -possibly concurrentsub-tasks. This is accomplished by automatically combining basic behaviors using Reinforcement Learning methods. The main idea is to build a global policy using a weighted combination of basic policies, the weights being learned by the agent (using Simulated Annealing in our case). These basic behaviors can either be learned or reused from previous tasks since they will not need to be tuned to the new task. Furthermore, the agents designed by our methodology are highly scalable as, without further refinement of the global behavior, they can automatically combine several instances of the same basic behavior to take into account concurrent occurences of the same subtask.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An investigation into the relationship between care burden and adaptive behaviors of mothers of children with thalassemia

Abstract Background and Objectives Mothers of children with thalassemia as primary caregivers face problems with treatment and care issues. These problems as consequences of child’s illness often lead to inappropriate adaptive behaviors. The aim of this study was to disclose the relationship between caregiver burden and adaptive behaviors in mothers of children with thalassemia.   Materials a...

متن کامل

بررسی مقایسه ای رفتار انطباقی در کودکان ناشنوا و کودکان با شنوایی طبیعی 12 تا 36 ماه

Introduction: Hearing loss in children is a main cause of malfunction in them. On the other hand, adaptive Behavior includes the age-appropriate behaviors necessary for people to live normally and in daily life. The objective of this study was to compare the adaptive behaviors of 12-36 months deaf and normal children. Methods: In this case- control study, we compared adaptive behaviors score o...

متن کامل

Combination of Adaptive-Grid Embedding and Redistribution Methods on Semi Structured Grids for two-dimensional invisid flows

Among the adaptive-grid methods, redistribution and embedding techniques have been the focus of more attention by researchers. Simultaneous or combined adaptive techniques have also been used. This paper describes a combination of adaptive-grid embedding and redistribution methods on semi-structured grids for two-dimensional invisid flows. Since the grid is semi-structured, it is possible to us...

متن کامل

Optimized computational Afin image algorithm using combination of update coefficients and wavelet packet conversion

Updating Optimal Coefficients and Selected Observations Affine Projection is an effective way to reduce the computational and power consumption of this algorithm in the application of adaptive filters. On the other hand, the calculation of this algorithm can be reduced by using subbands and applying the concept of filtering the Set-Membership in each subband. Considering these concepts, the fir...

متن کامل

Mediating Role of Emotion Regulation in the Relationship of Metacognitive Beliefs and Attachment Styles with Risky Behaviors in Children of War Veterans with Psychiatric Disorders

Objectives: High-Risk behaviors among adolescents are a major concern for mental and social health. The purpose of this study was to investigate the mediating role of adaptive and maladaptive emotion regulation strategies in the relationship of metacognitive beliefs and attachment styles with risky behaviors among adolescent children of war veterans with psychiatric disorders. Method: This is ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002